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Abstract. We say that a Cellular Automata (CA) is coalescing when 
its execution on two distinct (random) initial configurations in the same 
asynchronous mode (the same cells are updated in each configuration at 
each time step) makes both configurations become identical after a rea- 
sonable time. We prove coalescence for two elementary rules and show 
that there exists infinitely many coalescing CA. We then conduct an ex- 
perimental study on all elementary CA and show that some rules exhibit 
a phase transition, which belongs to the universality class of directed 
percolation. 



1 Introduction 

The coalescence phenomenon, as we call it, has been observed for the first time 
by Nazim Fates PP , in the context of asynchronous cellular automata. Coalescing 
CA exhibit the following behavior: starting from two different initial random con- 
figurations and running the same updating sequence (the same cells are updated 
at each time step in both configurations), the configurations quickly become 
identical, i.e. the dynamics not only reach the same attractor, they also synchro- 
nize their orbits. This of course appears in trivial situations, for example if the 
CA converges on a single fixed point, but Nazim Fates has also observed it in a 
case where the coalescing orbit is absolutely non trivial. 

The goal of this paper is to explore this rather strange emergent phenomenon 
in which the asymptotic behavior seems to be only related to the (random) 
sequence of update of the cells and not to the initial configuration. This work 
shows that, in some cases, the randomness used during evolution is as important 
as the one used during initialization: this stochastic dynamic, with high entropy, 
is perfectly insensitive to initial condition (no chaos here). 

The results presented here are of two kinds. First, we prove the existence 
of infinitely many different (we precise this notion) non trivial coalescent CA. 
Secondly, we study by simulation the behavior of all elementary CA (ECA) with 
regards of this coalescence property, in an asynchronous context in which at each 
step each cell has a fixed probability a to be updated. We show that over the 
88 different ECA, six situations occur: a/ 37 ECA never coalesce; b/ 20 always 
coalesce in a trivial way (they converge to a unique fixed point); c/ 6 always 
coalesce on non trivial orbits; d/ 14 combine a/, b/ and c/ depending on a (4 
combine a/ and b/, 3 combine b/ and c/, and 7 combine a/ and c/); e/ 7 enter 



either full agreement (coalescence) or full disagreement; last, f/ 4 combine e/ 
with either a/, b/ or c/. 

We also study the transition between non coalescence and coalescence when a 
varies for the EC A that combine a/ and c/: there is a phase transition belonging 
to the universality class of directed percolation. We thus get a new model of this 
class, with a few variants. An unusual fact among directed percolation models 
is that the limit of the sub-critical regime is neither a single absorbing state, 
nor a set of fixed points, but a non trivially evolving phase. Its originality and 
links to other domains could help understanding this class and hopefully lead to 
analytical results. 

The paper is organized as follows. Section |3 gives definitions and notations. 
We prove in section [3] that, under certain conditions, CA 6 and 7 (using the 
Wolfram's numbering of ECA) are coalescing and show how to construct from 
them coalescing CA with arbitrarily many states. We also prove that CA 15 
and 170 either coalesce or enter total disagreement, each case occurring with 
probability | . In Section 0] we describe the exhaustive simulation study of all 
ECA and then check the directed percolation hypothesis. Moreover, we prove 
that some CA exhibit two phase transitions: one for small a and one for high a. 

2 Definitions and Notations 

In this paper, we consider the dynamics of some CA when they are run on an 
asynchronous mode. Let us start by defining the synchronisms we work with. 

Definition 1. An asynchronous finite CA is a tuple (Q, d, V, 5, n, fi) where 

— Q is the set of states; 

— d G N* is the dimension; 

— V — {vi, . . . , W|y|}, the neighborhood, is a finite set of vectors in ifi ; 

— 5 : £j)! y ! — > Q is the transition rule; 

— n e N* is the size; 

— U := (Z/nZ) d is the cell space (with periodic boundary condition); 

— fi, the synchronism, is a probability measure on {0,1}^. 

A configuration specifies the state of each cell, and so is a function c :IA — > Q. 

The dynamic is then the following. Let ct denote the configuration at time i, 
(co is the initial configuration). Let {M t \ t € N} be a sequence of independent 
identically distributed random variables with distribution fi. The configuration 
at time t + 1 is obtained by 



In other words, for each cell z, we apply the usual transition rule if M t (z) = 1 
and freeze it (keep its state) if M t (z) = 0. 




Here are the two synchronisms we use. They are the most natural, even 
if others are possible (like systematic or alternating sweep, or updating some 
cells more often, but this require non independent M t and so a more general 
formalism). If x G {0, 1} W , let |x|i be the number of 1 in the coordinates of x. 

The Partially Asynchronous Dynamic Let < a < 1. For each cell, we update 
it with probability a, independently from its neighbors, /i is thus the product 
measure of Bernoulli distributions: /i(x) :— a' x ' 1 (l — a)' 2 ' (with 0° = 1). The 
case a = 1 corresponds to the synchronous dynamic. 

The Fully Asynchronous Dynamic At each step, we choose one cell and up- 
date it. Which defines fx(x) as 1/n if \x\i = 1 and fi(x) := otherwise. 

We now introduce the definition of coalescing CA to formalize the observation 
of pQ. The principle is to use two initial configurations, and to let them evolve 
with the same outcome of the random variables {M t \ t £ N}. In other words, we 
use two copies of the CA, and at each time step, we update the same cells in both 
copies. This comes down to using the same source of randomness for both copies, 
like in .2 (on another system) where the authors observe a synchronization. 

Definition 2. An asynchronous finite CA is coalescing if, for any two initial 
configurations, applying the same sequence of updates leads both configurations 
to become identical within polynomial expected time (with respect to n). 

Any nilpotent CA (converging toward a configuration where all states are identi- 
cal) is coalescing if it converges in polynomial time. But there are non nilpotent 
coalescing CA, which we call non trivial. We now consider only those CA. 

In the following, we heavily use the simplest CA, namely the Elementary CA: 
one dimension (d = 1), 2 states (Q = {0, 1}), nearest neighbors (V = { — 1, 0, 1}). 
There are 2 8 = 256 possible rules, 88 after symmetry considerations. We use the 
notation introduced by S. Wolfram, numbering the rules from o to 255. 

3 Formal Proof of Coalescence 

In this section, we prove that there are infinitely many coalescing CA. For that, 
we prove the coalescence of two particular CA and show how to build an infinite 
number of coalescing CA from one of them. An easy way to do that last point 
would be to extend a coalescing CA by adding states that are always mapped to 
one states of the original CA, regardless of their neighbors. However, we consider 
such a transformation to be artificial since it leads to a CA that is in some sense 
identical. To avoid this, we focus on state minimal CA: CA in which any state 
can be reached. Note that among ECA, only and 255 are not state-minimal. 

We first exhibit two state- minimal coalescing CA (proposition^! ; then, using 
this result, deduce the existence of an infinite number of such CA (theorem 0; 
and finally describe the coalescent behavior of two others ECA (proposition^. 

Proposition 1. Rules 6 and 7 are coalescing for the fully asynchronous dynamic 
when n is odd. 



Proof. We call number of zones the number of patterns 01 in a configuration, 
which is the number of "blocks" of consecutive 1 (those blocks are the zones). 
We first consider only one copy (one configuration). 
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Here is the transition ta- 
ble of 6. Since one cell at 
a time is updated, and up- 
dating the central cell of 101 or 010 does not change its state, zones cannot 
merge. 

On each pattern 111, the central cell can be updated (lead- 
ing to the pattern 101) before its neighbors (with probability 
i) with expected time n. On each pattern 0001, the opposite 
sequence is possible. It happens without other update of the 
four cells with probability 1/4 3 and with an expected time of 
3n. So, as long as there are patterns 000 or 111, the number of zones increases 
with an expected time 0(n). Since there are 0(n) zones, the total expected time 
of this increasing phase is 0(n 2 ). 

The configuration is then regarded as a concatenation of words on {0, 1}*. 
Separation between words are chosen to be the middle of each pattern 00 and 11, 
so we get a sequence of words that have no consecutive identical letters, each word 
being at least two letter long (that is, words of the language "(01) + 0? | (10) + 1?"). 
We now show that borders between these words follow a one way random walk 
(towards right) and meet, in which case a word disappear with positive proba- 
bility. The CA evolves therefore towards a configuration with only one word. 

Updating the central cell of 100 docs not change its state, so the borders 
cannot move towards left more than one cell. On the other hand, updating the 
central cell of 001 or 110 make the border move. One step of this random walk 
takes an expected time 0(n). 

The length of a word also follows a (non-biased) random walk, which reaches 
1 after (on average) 0(n 3 ) steps, leading to the pattern 000 or 111. This pattern 
disappears with a constant non zero probability like in the increasing phase. The 
expected time for 0(n) words to disappear is then 0(n 4 ). 

Since n is odd, the two letters at the ends of the words are the same, i.e. there 
is one single pattern 00 or 11, still following the biased random walk. We now 
consider again the two copies. This pattern changes the phase in the sequence 
(01) + , it is therefore a frontier between a region where both configurations agree 
and a region where they do not. The pattern in the other configuration let us 
come back to the region where the configurations have coalesced. We study the 
length of the (single) region of disagreement. It follows a non biased random 
walk determined by the moves of both patterns. 



When this length reaches n, as opposite, the only ••• 10100101 •• 
change happens when the fourth cell is updated, and it ■•• 01011010 •• 
decreases the length. So, the random walk cannot indefinitely stay in state n. 



On the other hand, when the length reaches 1, one ■•■ 01010010 ■•■ 
possibility is the opposite, where updating the fifth cell " " ' 01011010 ••• 
leads to coalescence. 

The other possibility is opposite, where updating the ■•• 01011010 ••■ 
fifth then the fourth cell leads to the former possibility. " " ' 01001010 ••• 
In each case, coalescence happens with a constant non zero probability. One step 
of this random walk takes an expected time 0(n), the total expected time of the 
one word step is thus 0(n 3 ) (details on expected time can be found in 0). 

So rule 6 is coalescing. The only difference of 7 is that 000 leads to 010, which 
does not affect the proof (only the increasing phase is easier). □ 

Remark 1. If n is even, the proof is valid until there is only one word, at which 
point we get a configuration without 00 nor 11. There are two such configu- 
rations, if both copies have the same, it is coalescence, otherwise both copies 
perfectly disagree (definitively). Both happen experimentally 

Theorem 1. For the fully asynchronous dynamic, there are non trivial state- 
minimal coalescing cellular automata with an arbitrarily large number of states, 
and therefore infinitely many non trivial state-minimal coalescing CA. 

Proof. Let A 2 be the product of a CA A = (Q, d, V, 5, n, by itself, defined as 
(Q 2 , d, V, S 2 , n, n) where S 2 ((a,b), (c,d), (e,/)) := (S(a,c,e), S(b,d,f)). Intu- 
itively, A 2 is the automaton we get by superposing two configurations of A and 
letting both evolve according to 6, but with the same M t . If A is state-minimal, 
so is A 2 . 

Let A be a coalescing CA. Then A 2 converges in polynomial expected time 
towards a configuration of states all in {(q,q) | q S Q}. From this point, A 2 
simulates A (by a mere projection of Q 2 to Q) and is therefore coalescing (with 
an expected time at most twice as long); and so are (A 2 ) 2 , ((A 2 ) 2 )', etc. We 
have built an infinite sequence of CA with increasing size. □ 

Proposition 2. 15 and 170, for both asynchronous dynamics, coalesce or end 
in total disagreement, each case with probability 1/2. 

Proof. 170 (shift) means "copy your right neighbor" . The configurations agree 
on a cell if and only if they agreed on the right neighbor before this cell was 
updated. So, it is a CA with two states: agree or disagree. This CA is still 170. 
This rule converges in polynomial time towards 0* (corresponding to coalescence) 
or 1* (full disagreement) [3]. By symmetry, each case has probability 1/2. 

15 means "take the state opposed to the one of your right neighbor" , and 
the proof is identical (the quotient CA is still 170). □ 



4 Experimental Study and Phase Transition 

In this section, we describe experimental results in the context of partially asyn- 
chronous dynamic. We show that many EC A exhibit coalescence and make a 
finer classification. Specifically, we observe that some ECA undergo a phase 
transition for this property when a varies. We experimentally show that this 
phase transition belongs to the universality class of directed percolation. 



4.1 Classifying CA with Respect to Coalescence 

Protocol We call run the temporal evolution of a CA when all parameters (rule, 
size, a and an initial configuration) are chosen. We stop the run when the CA 
has coalesced, or when a predefined maximum running time has been reached. 

Let us describe the parameters we used. We set n = 500 and n = 2 000 and 
got the same results. |S] showed rigorously that a close to 1 (more updates) 
does not mean faster convergence (indeed, it is proportional to a (^_ a ) )- We thus 
repeat each run three times: for a £ {0.05, 0.50, 0.95}. The maximum number of 
time steps is equal to a few times n 2 . For each rule, we do 30 runs: 10 random 
initial configurations (to ensure coherence) times 3 values of a. 

Results We get the following empirical classes of behaviour: 

a/ Some CA never coalesce (or take a too long time to be observed): 4, 5, 12, 
13, 25, 28, 29, 33, 36, 37, 41, 44, 45, 51, 54, 60, 72, 73, 76, 77, 78, 90, 94, 104 
105, 108, 122, 132, 140, 142, 150, 156, 164, 172, 200, 204, 232. 
— Some CA coalesce rapidly. 

b/ The trivial way to do this is to converge to a unique fixed point. One 
can consider the two copies independently and wait for them to reach 
the fixed point, the CA has then coalesced. This is the case for 0, 2, 8, 
10, 24, 32, 34, 38, 40, 42, 56, 74, 128, 130, 136, 138, 152, 160, 162, 168. 
c/ The non trivial rules are 3, 11, 19, 35, 46, 154. 
d/ Some CA combine two of the three previous behaviors, depending on a. 18, 
26, 106, 146 combine a/ and b/; 50, 58, 134 combine b/ and c/; 1, 9, 27, 57, 
62, 110 and 126 combine a/ and c/ (see fig.Ql. 
e/ Some CA end in either full agreement between configurations (coalescence) 
or full disagreement, depending on the outcome of M t and the initial config- 
uration: 14, 15, 23, 43, 170, 178, 184. 
f/ 6 combines the previous point (for small a) with b/, 7 do the same but with 
c/, 22 and 30 combine it (for small a) with a/. 

Let us to study the phase transition (coalescence or not) when a changes, that is, 
rules 1, 9, 27, 57, 62, 110 and 126. We test the hypothesis of directed percolation. 
Some rules (9, 110, i26) show two phase transitions, one for low a, noted ge, 
one for high a, noted gh- The ones for low a (gt, 110^ and 126^) are "reversed", 
that is, coalescence (sub-critical regime) occurs for higher a. 57 is also reversed. 

4.2 Directed Percolation 

Due to lack of space, we refer to 0] for a presentation of directed percolation, 
which also explains damage spreading, another point of view on this phenomenon. 

Our active sites are the cells where the configurations disagree (density of 
such sites is written p). Percolation appears when varying a, see fig-O The aim 
is thus to identify /3 assuming that p(ct) = c(a — a c )@ for some c and a c . 



Fig. 1. (color online) rule no, n = 500. Time goes from left to right, during 500 
steps. Active sites are dark, coalesced site are light (with light blue standing for 
state 1, white for 0). Left: sub-critical phase (a — 0.47 < a c ~ 0.566), branches 
die. Right: supercritical phase (a = 0.65 > a c ), active sites spread 

Measure of a c We use the method described in 4 : plot the density p of 
active sites versus time in logarithmic scale and find the a value for which one 
gets a straight line (for a < a c , the AC coalesce faster, for a > a c , it has a 
positive asymptotic p). We used random initial configuration with each state 
equiprobable. To get readable plots we needed up to n = 10 6 cells and 10 7 time 
steps. We get (recall that a c is not universal, it is just used to compute (i): 



rule 


l ge g h 17 62 noe no h iz6 e n6 h 57 


a c > . . . 
a c < ... 


0.102 0.073 0.757 0.856 0.598 0.073 0.566 0.101 0.720 0.749 
0.103 0.074 0.758 0.858 0.599 0.075 0.567 0.102 0.721 0.750 



Note that the a c of 1 and 126^, like gi and no^, are very close. 



Measure of j3 We now plot p vs a (fig. |2J). The assumption p — c(a — a c )^ 
is valid only near a c . To determine which points should be taken into account, 
we plot p(a) on a logarithmic scale with x-origin roughly equal to a c (precision 
does not affect the result). We keep only the beginning of the curve which is a 
straight line. We varied the number of points taken into account to estimate the 
loss of precision due to this choice. 

Protocol n = 10 000. We let the system evolve for T tr = 100 000 steps, then 
measure p during T samp = 10 000 step and compute the average. We repeat 
such a run for each a value with a fine sampling. The fact that the curve is 
smooth (except very near a c ) tells us that measures do not depend much on the 
randomness (nor on T samp ) and that we do not need error bars. We also checked 
that the results do not vary when we change n and T tr . 
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Fig. 2. p(a). Rule no, n = 10 000. Fitted curve agrees with points used for the 
fit (bold). 



The fit gives the following ranges, taking into account uncertainty about a c 
and which points to keep for the fit. Experimental value for (3 measured on other 
systems is 0.276. 



rule 


l g e g h 27 62 noe no h n6e iz6 h 57 


(3>... 
P<... 


0.265 0.270 0.273 0.258 0.270 0.270 0.271 0.250 0.260 0.248 
0.279 0.295 0.283 0.305 0.281 0.291 0.281 0.276 0.276 0.281 



As expected, this model seems to belong to the universality class of directed 
percolation (except perhaps 27, due to higher noise and thus lack of precision). 
Source code is available on cimula.sf.net. 
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